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Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims: 

1 . (Currently amended) A device for automatically identifying the language of a 
digital text, comprising: 

means for prestoring first character strings , including prefixes, suffixes and infixes, of 
different lengths from words of a plurality of predetermined languages, that occur 
frequently anywhere respectively in said words of [[a]]said plurality of predetermined 
languages and charactor i zo sa i d prodotorm i nod l anguages , 

means for prestoring second character strings of different lengths, that are 
atypical anywhere respectively in said words of said predetermined languages, 

means for analyzing words extracted from said digital text, thereby constructing 
for each extracted word ffallll a plurality of character strings contained in said extracted 
wo rd , including prefixes, suffixes and infixes, with overlap and different and hav i ng 
lengths lying between one character and the number of characters in said extracted 
word, and 

moans for compar i ng character strings conta i ned i n oxtractod words to prostorod 
character str i ngs i n ordor to dotorm i no scores assoc i ated w i th sa i d prodotorm i nod 
l a ngu a g es 

means for i nd i v i dua ll y comparing each of rfallll said plurality of character strings 
contained in each said extracted word to said first and second prestored character 
strings of oach dotorm i nod l anguago said predetermined languages so that whenever a 
first character string is found in said extracted word, a score associated with said 
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eaefrone determined language is increased by a first coefficient depending on the 
position of said first character string of said one determined language is found in said 
extracted word, and, whenever a second character string is found in said extracted 
word, said score is decreased by a respective second coefficient that is associated with 
said found second character string, said respective second coefficient increasing as the 
probability of said found second character string in said oach one determined language 
decreases, and 

means for comparing said scores for said text associated with said 
predetermined languages in order to determine the highest of said scores, which 
identifies the language of said text. 

2. (Cancelled) 

3. (Original) The device claimed in claim 1, wherein said first coefficient of a 
first character string in said extracted word depends on the frequency of said character 
string in said determined language. 

4. (Original) The device claimed in claim 1, wherein said first coefficient of a 
first character string in said extracted word depends on the length of said character 
string. 

5. (Original) The device claimed in claim 1, wherein said first coefficient of a 
first character string in said extracted word is equal to: 

PO (FR+LON), 

where 
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PO is a coefficient depending on the position of said first character string in said 
extracted word, 

FR is a coefficient depending on the frequency of said first character string in a 
determined language, and 

LON is a coefficient depending on the length of said first character string. 

6. (Previously presented) The device claimed in claim 1, comprising 
comparator means for comparing each of said extracted words from said text with 
frequent words in said determined language and initially listed in storage means so that 
whenever a frequent word is found in said text, said score for said determined language 
is increased only by a coefficient depending on the frequency of said extracted word in 
said determined language. 

7. (Previously presented) The device claimed in claim 1, comprising 
comparator means for comparing each of said extracted words from said text with 
frequent words in said determined language and initially listed in storage means so that 
whenever a frequent word is found in said text, said score for said determined language 
is increased only by a coefficient depending on the length of said frequent word. 

8. (Currently amended) A method of automatically identifying the language of a 
digital text, comprising the steps of: 

prestoring first character strings , including prefixes, suffixes and infixes, of 
different lengths from words of a plurality of predetermined languages, that occur 
frequently anywhere respectively in words of [[a]]said plurality of predetermined 
languages that charactor i zo sa i d prodotorm i nod l anguages ; 
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prestoring second character strings of different lengths, that are atypical 
anywhere respectively in said words of said predetermined languages; 

analyzing words extracted from said digital text, thereby constructing for each 
extracted word ffallll a plurality of character strings contained in said extracted word A 
including prefixes, suffixes and infixes, with overlap and different and hav i ng lengths 
lying between one character and the number of characters in said extracted word, and 

compar i ng character str i ngs contained in oxtractod words to prostorod character 
str i ngs i n order to dotorm i no scores associated w i th sa i d prodotorm i nod l anguages, 

i nd i v i dua ll y comparing each of rfallll said plurality of character strings contained in 
each said extracted word to said first and second prestored character strings of each 
dotorm i nod l anguago said predetermined languages so that whenever a first character 
string is found in said extracted word, a score associated with oach sa i d one determined 
language is increased by a first coefficient depending on the position of said first 
character string found in said extracted word, and, whenever a second character string 
of said one determined language is found in said extracted word, said score is 
decreased by a respective second coefficient that is associated with said found second 
character string, said respective second coefficient increasing as the probability of said 
found second character string in oach said one determined language decreases, and 

comparing said scores for said text associated with said predetermined 
languages in order to determine the highest of said scores, which identifies the 
language of said text. 

9. (Cancelled) 
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10. (Previously presented) The method of claim 8, wherein said first coefficient 
of a first character string in said extracted word is equal to: 

PO (FR+LON), 

where 

PO is a coefficient depending on the position of said first character string in said 
extracted word, 

FR is a coefficient depending on the frequency of said first character string in a 
determined language, and 

LON is a coefficient depending on the length of said first character string. 

11. (cancelled) 
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